Antipersistent binary time series 
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Completely antipersistent binary time series are sequences in which every time that an A^-bit 
string II appears, the sequence is continued with a different bit than at the last occurrence of /i. 
This dynamics is phrased in terms of a walk on a DeBruijn graph, and properties of transients and 
cycles are studied. The predictability of the generated time series for an observer who sees a longer 
or shorter time window is investigated also for sequences that are not completely antipersistent. 



I. INTRODUCTION 

The analysis and generation of time series has been of 
interest to physicists in different fields: it yields insight 
into the dynamics of chaotic systems [^|] and is used to 
study such diverse systems as the climate and the heart- 
beat linguistics Q|, and, in recent times, financial 
markets [^||. The prediction of time series generated 
by complex physical systems can be of immense impor- 
tance, as evidenced by the efforts put into improving the 
weather forecast. One of the most common aspects of 
time series is long-term memory, or persistence. This 
paper, however, will deal with the opposite behaviour, 
namely antipersistence. 

Antipersistence is the tendency of a time series to show, 
at one point, the opposite behaviour than at some point 
in the past. This concept is usually applied to continuous 
time series , but has received little attention due to the 
dearth of natural phenomena that exhibit antipersistence 
(for an exception, see |^). 

In this paper, antipersistence in binary time series is 
defined as follows: a sequence of bits is antipersistent on 
a scale of if a string n oi N bits length is likely to 
be followed by the opposite bit than the last time that 
/i appeared. This property was observed, for example, 
in some parameter regimes of the conventional Minority 
Game §,0 and in various alternative models based on 
the idea of the Minority Game [0 . 

This paper will first deal with completely antipersis- 
tent time series from the point of view of discrete dy- 
namical systems. Properties o f cy cles and transients will 
be studied in Sections [II and IV, respectively. Concepts 



and results from graph theory turn out to yield much 
insight. 

In Section the sequence is treated as an anti- 
predictable sequence for a special prediction algorithm, 
and antipersistence on one time scale is related to proper- 
ties on other time scales. Then a more general stochastic 
model is introduced, and the effects of stochasticity on 
the previous results are discussed. 



II. THE MODEL 

An infinitely long, completely antipersistent binary 
time series of bits Sr G {0, 1} can by definition be gen- 
erated in the following fashion: at time t, the history 



fit denotes the binary representation of the last N bits 
St-N+i, ■ ■ ■ ,st- For each of the 2^ possible histories fi 
there is an entry G {0, 1} in a decision table At, which 
also depends on t. At each time step. 



• a new bit is generated by taking the table entry cor- 
responding to the current history /i( (which I will 



also refer to as "the pattern"): st+i 



,A»t. 



• the history is updated: Ht+i — {2 fit + St+i) mod 
2^; i.e., all bits are shifted one position to the 
left (multiplication with 2), the newly generated 
bit is added, and the oldest (most significant) bit 
is dropped (division modulo 2^); 



• the table entry that was used for making the 
decision is changed, such that the sequence will be 
continued with the opposite decision when the pat- 



tern fit occurs the next time: 
other entries remain unchanged. 
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This last point is especially important. It means that 
the table entries are dynamical variables, and the state 
of the dynamical system is determined by the current 
pattern and the state of the decision table. Fig. ^ shows 
two steps of the described dynamics of such a system for 
N = 2. 
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FIG. 1. An example of a decision table with N = 2 and two 
steps of the dynamics. Boldface numbers indicate the current 
history and the table entry used for continuing the sequence; 
italic numbers denote the last table entry that was changed. 
The sequence generated in this example is 010 .... 
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The model can also be considered from a graph- 
theoretical perspective: each pattern corresponds to 
a node on a directed DeBruijn graph of order N (see Fig. 
^ for an example of such a graph) . Each node oljviously 
has two edges entering it, coming from the two possible 
predecessors, which I denote and ^fi. For example, 
if = 1100, the possible predecessors are °/i — 0110 
and ^fj, — 1110. Each edge also has two outgoing edges, 
leading to the two possible successors /i° and fi^ (for 
/i = 1100, the successors are /Lt° = 1000 and fi^ = 1001). 
The graph is connected, since one can reach each node 
from any other node in a maximum of N steps by taking 
the appropriate exit edges. 




FIG. 2. The directed DeBruijn graph of order 3: Nodes 
represent binary strings of length 3, edges lead to strings that 
are generated by shifting the current string one position and 
adding either or 1 as the new least significant bit. 

In our model, at any time, only one of the exits leaving 
each node is labelled "active" - the one corresponding to 
the table entry a^. A time step consists of travelling from 
the current node to the next along the active exit, then 
"burning the bridge" , i.e. labelling the previously active 
exit inactive and vice versa. 




FIG. 3. Example for an irreversible situation on a graph of 
order 2. Active exits are denoted by solid lines, inactive ones 
by dashed lines. The bold circle indicates the node currently 
visited. Both the upper left configuration (which happens to 
be part of a cycle, and corresponds to the example in Fig. |l| 
) and the lower one (which is part of a transient) lead to the 
configuration on the right. 



III. PROPERTIES OF CYCLES 

The introduced dynamics is deterministic, and the 
combined system of pattern and table has a finite num- 
ber = 2^ • 2^^ ) of different states, so the dynamics 
necessarily leads into a cycle eventually. The dynamics 
is irreversible: if a currently visited node has two inactive 
entrances, it is impossible to tell which path the system 
took to get to its current state (for an example, see Fig. 

This means that not every state can be part of a cy- 
cle, so we will have to consider the necessary conditions 
for being in a cycle. I will show, step by step, that all 
cycles are of length 2 • 2^ and touch all nodes exactly 
twice. 

Some of the proofs that now follow are redundant; on 
the other hand, they help to understand the properties 
of the system, and some of them are applicable to gener- 
alizations of the problem, whereas others are not. 

Let us assume that at time 0, the system is already 
moving on a cycle of length /. We count the number of 
times that a history fi has occurred between time and 
time f by a visit number v^. Since the definition of a 
cycle is that after I steps the system must be in the same 
state again, it is necessary that is even for all ii, since 
the table entries return to their original state only 
after even numbers of visits. 

Also, all possible nodes are part of the cycle. Let us 
prove this by assuming the opposite, namely that there 
are some nodes that are not touched by the cycle. Since 
the graph is connected, there must be unused connections 
between the part of the graph involved in the cycle and 
the part that is left out. But as we have seen in the para- 
graph before, the visit number of each of the nodes that 
are actually part of the cycle must be at least 2 (larger 
than 0, and even), so each of its two exits is used, includ- 
ing the one leading to the part of the graph supposedly 
not included in the cycle. This is a contradiction, so all 
nodes are involved. 

An even stronger statement is possible: the total num- 
ber of visits to the predecessors °/x and ^/i of fi must be 
equal to twice the number of visits to fj,, since exactly 
half of the visits they get are followed by /Lt, while the 
other half is followed by the so-called conjugate state fi 
of /i. (For example, 001 is the conjugate state of 000.) 
Thus, we have 



iv''+v'')/2 foraU/i. 



(1) 



This can be written as a linear equation for an eigenvector 
with eigenvalue 1 of a matrix M, with entries Oj/^ = 1/2 
if /X is a possible successor of v and a^^ = otherwise. 
For example, for N = 2, the set of Eqs. (|l|) looks as 
follows: 
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Since the sum of columns in matrix M is always 1, and 
the individual entries are > 0, and it describes transi- 
tions on a connected graph, we can apply results from 
the theory of stochastic matrices to state that it has one 
unique eigenvector with eigenvalue 1 ]T^ , and we easily 
guess that = const fulfills Eq. (|l]). That means that 
in a cycle, all states are visited with the same frequency. 

The next step is to show that in a cycle, each node is 
exactly visited twice, i.e., all cycles of length 4-2^, 6-2^, 
and so on, are in fact two, three or more repetitions of 
a 2 ■ 2''^-cycle. Again, assume the system is moving on a 
cycle. If this cycle were truly longer than 2-2^, there 
must, at the point t = 2 ■ 2^, be nodes that have been 
visited three or more times while others have not been 
visited for the second time - the visit numbers must add 
up to 2 • 2^, and if all visit numbers were equal to 2, the 
cycle would be complete. More specifically, there must 
be an earlier time when all visit numbers are either 
0, 1, or 2, and one node is about to be visited for the 
third time. It suffices to show that this cannot happen 
to prove that the cycle cannot be longer than 2-2^. 

The third visit to a node /i with v'^ = 2 cannot come 
from a predecessor (let us say, "^/i) with a visit number 
of w = 0, for the obvious reason that this predecessor 
has not been visited yet. It also cannot come from a pre- 
decessor with V — 1: a v'^ — 2, either it must have 
been visited before from (which it cannot - the pre- 
decessor has only had one visit so far), or it must have 
been reached twice from ^/i - this is impossible as well, 
since it means that w > 3. For similar reasons, we can 
exclude a visit from a node with v ^ —2: either v ^ >i 
as before, or the first visit to '^/i led to /i - then the sec- 
ond cannot. This means that all nodes must receive two 
visits - thus finishing a cycle ~ before one of them can 
be visited for the third time. The question arises why 
this line of reasoning does not hold true during the tran- 
sient. The key lies in the observation that the first node 
to receive three visits is the node where the system was 
started - it did not get its first visit from anywhere on 
the graph, so the arguments are not applicable. 

Using all previous conclusions, the cycles turn out to 
be solutions to a well-studied combinatorial problem, and 
the number of different cycles can be found in the litera- 
ture: since, during a cycle, each string /i appears exactly 
once followed by each of ist successors, the cycle is a se- 
quence in which each A^-f- 1-bit pattern, i.e. each node on 
the DeBruijn graph of order A^-|- 1, appears exactly once. 
Such a sequence is a Hamiltonian circuit, also known as 
a full cycle, on the N -\- 1-graph. For a review on the 
properties of these cycles, consult Ref. |l^. One of the 
earliest and most central results on the topic is the num- 
ber of different cycles, which is 2^ -(A^+i) [J4| 

All possible full cycles on the A^ -I- 1-graph can be gen- 
erated by the antipersistent walk on the A-graph: write 
down the desired sequence starting at some arbitrary 
point, look for the first occurrence of each A^-bit pat- 
tern ji, and set the corresponding table entry to the bit 



that follows it. Starting the antipersistent walk at the 
first pattern of the desired sequence, the antipersistent 
walk will reproduce it. 



Since all cycles are of length 2-2^, a total of 2 • 2^ x 
22 -(N+i) _ ) states is part of a cycle. As men- 
tioned before, the total number of possible states is O = 
2^-2(2™), which means that a fraction of 2(2" = 
of possible states is part of a cycle. 



IV. PROPERTIES OF THE TRANSIENT 



The length r of the transient is the number of steps 
taken before the system enters a cycle. The distribu- 
tion of transient lengths is less accessible to analytical 
approaches, but easy to measure in computer simula- 
tions, either by complete enumeration for small systems 
or by Monte Carlo for larger ones. The following picture 
emerges, as seen in Fig. 
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FIG. 4. Distribution of transient lengths r, rescaled by the 
cycle length 2^+\ 

The probabilty for transient length r = is just the 
probability of hitting a cycle right away, and thus the 
fraction of state space filled with cycles. As mentioned, 
this is equal to 2~^. 

The probability distribution is more or less flat for 
1 < T < 2^+^, the cycle length. From normalization 
constraints, it follows that p{t) « 2~(^"'"^^ in that range. 

Near t = 2^+^, there is an exponential drop reminis- 
cent of a phase transition, which gets steeper with in- 
creasing A. Even for small A^, no transients longer than 
2^+2 -j^QyQ been observed. 
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V. ANTIPERSISTENCE ON DIFFERENT 
TIMESCALES 

Consider the following prediction algorithm: an ob- 
server looks at A'^ohs-bit strings from a binary time series, 
writes down the bit that followed the pattern in the ap- 
propriate entry of his decision table (which, of course, 
has 2^°''= rows), and predicts that when that pattern oc- 
curs the next time, it will be followed by that same bit 
written in his table. 

This algorithm, which could be labelled "blind reliance 
on recent experience" , works well for persistent time se- 
ries, and it is similar in spirit to what we all do instinc- 
tively - similar situations usually lead to similar con- 
sequences. An interesting quantity is the success rate 
s{N, Nobs) of this prediction algorithm when it predicts a 
completely antipersistent walk on the iV-graph. For sim- 
plicity's sake, we consider the long-time limit, in which 
both the generator and the observer move on a cycle. 

If the observer looks at the same time window as the 
generator {Nobs = N), it is obvious that the success rate 
will be - since each pattern is continued with alter- 
nating bits on each visit. In that sense, the generated 
time series is anti-predictable for this specific prediction 
algorithm (for more on anti-predictable sequences, see 

For an observer with a slightly larger window, the pic- 
ture changes: as mentioned above, the antipersistent cy- 
cle corresponds to a Hamiltionian cycle on the + 1- 
graph, which is completely persistent and predictable 
with 100% accuracy. For even larger Nobs, the antipersis- 
tent cycle looks like a closed path which includes only a 
fraction of 2^~(^°''»+i) of nodes on the iVob^-graph. Pre- 
diction is again 100% reliable, and the observer does not 
even need all of his storage capacity to handle the cases 
that occur. 

If the observer has a shorter time window than the 
generator, more than one of the generator's patterns will 
affect the same table entry for the observer. For exam- 
ple, an — 1-bit pattern v corresponds to either of the 
iV-bit strings Ov or Iv, both of which occur twice in the 
iV-cycle, each time followed by a different successor. The 
success rate of the predictor depends on the sequence in 
which these combinations occur; if each permutation of 
Oi^O, Oi^l, li^O and li^l has the same probability, the suc- 
cess rate for all patterns is the average over the different 
permutations. Figure ^ shows that this average is 1/3 for 
Nobs ^N-1. 

For Nobs = N — 2, all permutations of eight combina- 
tions of predecessors and successors have to be taken into 
account - a task best left to computer algebra programs, 
which yield {s{N,N — 1)) = 3/7, in excellent agreement 
with simulations (see Fig. ^. Larger differences in the 
time window are beyond even the scope of computer pro- 
grams; however, it can be argued that for larger N—Nobs, 
the visits to the A'^ofcs-nodes become more and more ran- 
dom, and s{N,Nobs) will tend to 1/2. 
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FIG. 5. Possible permutations of predecessors and succes- 
sors of a pattern /i on a cycle. The error rate, given in the last 
column, is the rate of flips between and 1 in the sequence 
of successors. 



VI. STOCHASTIC ANTIPERSISTENCE 

All of these observations relied on the fact that the 
generator is on a cycle with well-known properties. It is 
thus interesting to ask how stable these results are if the 
sequence is not completely antipersistent. The simplest 
generalization is to introduce a probabilty p for chang- 
ing the table entry/ exit when visiting a node: p ~ 1 
reproduces the completely antipersistent walk; p = is 
equivalent to using a constant (quenched) decision table, 
and p = 1/2 generates a completely random time series. 

A first intuitive guess would be that even a small de- 
viation from deterministic dynamics completely destroys 
all predictability: after all, on a path of length 2^+^, 
there are on the average (1 —p)2'^~^^ occasions where the 
the sequence is continued persistently, thus leaving the 
cycle. Indeed, a single "error" is usually enough to move 
the system from one cycle to another; however, much of 
the local structure remains untouched. It turns out that 
the functions s{N, Nobs,p) of prediction rates converge 
for large N (meaning roughly iV > 12) to a set of curves 
that depend only on p and N — Nobs , which is displayed 
in Fig. ^. 

The limit values for p = 1 have been explained above, 
and they are approached continuosly for p — > 1. For 
p — 0.5, the curves intersect at s = 0.5 - no prediction 
beyond guessing is possible. For small p, all curves con- 
verge to 1: the system is dominated by short loops in 
which only a small fraction of the possible states partic- 
ipate, and those are predicted with high accuracy. 

Interestingly, between p = 0.5 and roughly p ~ 0.85, 
all shown curves are below 0.5, meaning that even ob- 
servers with longer memory predict the sequence with 
less than 50% accuracy. I will give an analytical argu- 
ment why this is the case for Nobs = + 1. An A^ -I- 1-bit 
pattern is a combination of an A^-bit pattern fi and one 
of its predecessors, let us say '^/i, whereas the companion 
state i) is a combination of /i and the other predecessor 
^/i. A visit to either v or P switches the exit of with 
probability p. Consider two subsequent visits to ly, with 
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some number I of visits to between them. The proba- 
bihty s{p, N, N +1) of continuing with the same bit after 
these two visits is a sum of two probabihties: either the 
exit of /i was switched upon leaving v the first time and 
then switched an odd number of times during the I visits 
to j), or it was not switched the first time and switched 
an even number of times in between. Given p and the 
probabihty TTi{p, N) of having I intermediate visits to v, 
one then obtains by basic combinatorics 



oo ^ 

s(p, iV, TV + 1) = ^ -TT, (p, iV) [1 + (1 - 2p)'+i] . 



(3) 



Unfortunately, tt; {p, N) does not seem to be analytically 
accessible for general p. It can be measured in simula- 
tions, and the accuracy of Eq. (||) verified (see Fig. ||); 
also, for p = 1/2, since the system does a completely 
random walk on the graph, one gets the simple distri- 
bution 7r;(l/2,7V) = 2~('+i). Assuming that this distri- 
bution does not change discontinuously near p = 1/2, 
Eq. yields the approximation s{l/2 + Sp, N, N +1) « 
1/(2 + 26p) « (1/2)(1 - Sp). This is obviously < 1/2 for 
Sp > 0, i.e., p > 1/2. 




FIG. 6. Success rate of an observer keeping a table of the 
recent occurrence of A''oi,s-bit strings and the respective fol- 
lowing bit, for TV = 16. p is the probability of flipping the 
exit in the generating graph. The symbols labelled "theory" 
were calculated using Eq. (^ with approximate probabihties 
TTi (p, A'^) taken from the simulation itself. 



1 - s{p, N,N+1)^ 
l-s{{l~s{p,N,N + l)},N+l,N + 2). (4) 

It is thus not sufficient to give a single parameter p, or 
1 — s, for some N in order to characterize the behaviour of 
a time series completely and to calculate its predictabilty 
on other scales of observation. The scale on which the 
dynamics work is important as well. 



VII. RESULTS AND CONCLUSION 

I introduced a deterministic algorithm that generates a 
binary time series that is completely antipersistent with 
respect to strings of length N. After a short transient, the 
algorithm runs into cycles of length 2-2^, in which each 
string appears exactly twice. These cycles correspond to 
Hamiltonian paths on a DeBruijn graph of order -I- 1. 

The cycle length is much larger than the typical cycle 
length of a graph with fixed decision tables. This seems 
typical for antipredictable sequences: sequences that can 
be predicted with 100% accuracy by some prediction al- 
gorithm usually do not require adaptation of the algo- 
rithm's parameters, whereas antipredictable sequences 
explore the combined phase space of the sequence and 
the generating algorithm, allowing for more, longer, and 
more complex cycles. In this case, however, the dynam- 
ics allow for fairly simple proofs of the properties of the 
cycles. 

Observers that keep track of the most recent occur- 
rence of A^ohs-bit strings can predict the completely an- 
tipersistent cycle with 100% accuracy if Nobs > A, and 
with less than 50% success rate if Nobs < A. If the 
stochasticity is introduced by means of a probability p of 
flipping the exit edges, the success rate even of observers 
with Nobs > N can drop below 50%, which shows that 
larger memory does not necessarily give better results. 
The rate of antipersistence on one scale is not sufficient 
to calculate the rate for other scales. 
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The error rate 1 — s{p, N, Nobs) is a measure of the an- 
tipersistence of the time series on the scale Nobs - for 
Nobs = A, 1 — s is completely equivalent to the antiper- 
sistence parameter p of the underlying dynamics. How- 
ever, even if s{p, N,N + 1) could be calculated, it would 
not be possible to recursively apply this function to find 
the antipersistence on the scales A^ -I- 2, A^ -I- 3 etc. In 
other words. 



[1] F. Takens, in Dynamical Systems and Turbulence, edited 
by D. A. Rand and L. S. Young (Springer, Berlin, 1981), 
Chap. Detecting strange attractors in turbulence, pp. 
365-381. 



5 



[2] T. Sauer, J. A. Yorke, and M. Casdagli, J. Stat. Phys. 

65, 579 (1991). 
[3] A. Bunde and J. W. Kantelhardt, Physikalische Blatter 

57, 49 (2001). 

[4] I. Kanter and D. A. Kessler, Phys. Rev. Lett. 74, 4559 

(1995). 

[5] N. Johnson et al, |cond-mat/0105305 (unpublished). 
[6] For further references on time series analysis in fi- 
nancial markets, consult the Econ ophysics homepage. 



http : // www. unifr . ch / econophysics / 



[7] H. Kantz and T. Schreiber, Nonhnear time series analy- 
sis (Cambridge University P ress, Cambridge, UK , 1997). 

[8] M. Ausloos and K. Ivanova, [cond- mat/0103367 (unpub- 
lished). 



lished) . 

[10] R. Savit, R. Manuca, and R. Riolo, Phys. Rev. Lett. 82 
2203 (1999). 



[9] M. Marsili and D. ChaUet, cond-mat/0004376 (unpub- 



[11] G. Reents, R. Metzler, and W. Kinzel, |cond-mat /0007351 
(unpublished) . 

[12] W. Feller, An Introduction to Probability Theory and 
its Applications (John Wiley & Sons, New York, 1970), 
Vol. 1. 

[13] H. Fredricksen, SIAM Review 24, 195 (1982). 

[14] C. Flye Sainte-Marie, L'Intermediaire des mathemati- 

ciens 1, 107 (1894). 
[15] H. Zhu and W. Kinzel, Neural Computation 10, 2219 

(1998). 

[16] R. Metzler, W. Kinzel, L. Ein-Dor, and L Kanter, Phys. 
Rev. E 63, 056126 (2001). 



6 



